Data-146

View on GitHub

Download the anonymized dataset describing persons.csv from a West African county and import it into your PyCharm project workspace (right click and download from the above link or you can also find the data pinned to the slack channel). First set the variable wealthC as your target. It is not necessary to set a seed.

When running a linear regression of the persons.csv data with wealthC set as the target data I got the following results

I then standardized the data and got the following results

Standardizing actually created a weaker training and testing and a small increase in MSE so standardizing the data was unsuccessful in finding a better result.

I than ran a ridge regression with the data to try and find better results.

The results from the ridge regression are currently the best results.

I then ran a lasso regression

In conclusion the Lasso and Ridge regression are equally good and more predictive then a simple linear regression.

I then ran the code with the variable wealth I

I then changed the target value from the ordinal data of wealthC to the continuous data of wealthI I got the following results

I then standardized the data and got the following

I then ran a ridge regression on the data and received the following.

Lastly I ran a Lasso Regression and received the following results.